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REMARKS 

I. Status of the Claims/ Amendments 

Claims 1-24, 27-35, 43, 44, 48-54 and 58-68 are pending. Claims 10, 21, 27, 49, 
59, 62, 64, and 68 are currently amended. 

The Examiner noted that claim 24 on page 5 of the previous amendment was 
listed as both "currently amended" and "cancelled". Li the application as filed, there 
were two claims numbered 24. The first was amended, the second was cancelled and 
presented as new claim 68. 

SEQ ID NO: 6 has been resubmitted to correct an error in the originally filed 
listing. As indicated in Table 5, SEQ ID NO: 6 is 85 residues long, fi-om 756-840 of SEQ 
ID NO: 1. Also, the transposed amino acid Hstings in SEQ ID NO: 7 and SEQ ED NO: 8 
have been corrected to conform to the corresponding portions of SEQ ID NO: 1 . 

II. Provisional Double Patenting Rejection 

Applicant requests clarification regarding the statement that SEQ ID NO: 1, 2, 4, 
5, 7 and 8 of apphcation 09/917,384 are 100% identical with the same sequences of the 
current application. The applicant has reviewed both sets of sequence listings and finds 
them to be different. The sequences of both applications are included as an appendix to 
this response. Since the claims address different sequences, it is respectfiiUy requested 
that the Office withdraw the provisional double patenting rejection of claims 1-24, 27-35, 
43,44, 58-54 and 58-68. 

III. Rejection Under 35 U.S.C. S 101 

Claims 22-24 and 68 stand rejected because the Office asserts that the claimed 
invention is not supported by either a specific substantial asserted utility or a well 
established utility, and is not shown to be enabled. Applicant, however, contends that 
several utilities are asserted and supported by the specification. The instant invention is a 
novel cellulase fi-om Acidothermus cellulolyticus, GuxA (p.l, line 10), which is "in 
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particular a thermal tolerant glycoside hydrolase useful in the degradation of cellulose" 
(p. 4, lines 8-9). As discussed on p. 3, lines 27-29, cellulase enzymes are capable of 
assisting in the commercial-scale processing of cellulose to sugar for use in biofuel 
production. Further support for the usefulness of GuxA may be found on: (p. 5, lines 6- 
9) GuxA may be "administered to a biomass containing cellulose for the reduction or 
degradation of the cellulose"; (p. 17, lines 26-30) "Like other members of the cellulase 
family, and in particular thermostable cellulases, GuxA polypeptides are useful in the 
conversion of biomass to biofuels and biofuel additives, and in particular, biofuels from 
cellulose. It is envisioned that GuxA polypeptides could be used for other purposes, for 
example in detergents, pulp and paper processing, food and feed processing, and in textile 
processes"; (p.27, lines 8-9) "Compositions of the present invention can be used in 
stonewashing jeans such as is well known in the art"; (p. 27, lines 15-16, regarding 
pharmaceutical compositions) "For example, GuxA polypeptides may be used where 
carbohydrate-mediated liposomal interactions are involved with target cells." 

Contrary to the Office assertion, the GH6 and GH12 family hydrolases do have 
well established and well known utilities, respectively, as the exoglucanases and 
endoglucanases reported on page 17 at lines 9-18 of the specification. A family 
comparison with the asserted catalytic domain utility is set forth in Example 2 beginning 
on page 33. These items are sufficient to assert utility and show operabiUty according to 
expectations for the GH6 and GH12 family. For these reasons, Applicant's attorney 
respectfully requests withdrawal of the rejection. 

IV. Rejection Under 35 U.S.C. S 112, First Paragraph 

Claims 22-24 and 68 stand rejected with the Office asserting that "[sjince the 
claimed invention is not supported by either a asserted utility or a well established utility 
for the reasons set forth [in the rejection under 35 U.S.C. §101], one skilled in the art 
clearly would not know how to use the claimed invention." 

As discussed above, numerous utilities for the present invention were asserted in 
and supported by the specification. Furthermore, the specification states that enzymatic 
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hydrolysis of biomass (p. 2, lines 9-10) and the use of cellulase in the stonewashing of 
jeans (p. 27, lines 8-9) are well known. Therefore, one of ordinary skill in the art would 
know how to use the cellulase claimed herein in at least those capacities. For these 
reasons, Applicant's attorney respectfully requests withdrawal of the rejection. 

V. Rejection Under 35 U.S.C, § 112, Second Paragraph 

Claims 10, 20, 21, 49, 59, 62, 64 and 68 were rejected as indefinite for failing to 
particularly point out and distinctly claim the subject matter which applicant regards as 
the invention. The claims have been amended as per Examiner's suggestions. Claim 10 
is now directed to "CBD U"; claims 20 and 21 recite "amino acid sequences"; claim 21 
has been amended to reflect that the amino acid sequence is encoded by the nucleic acid 
sequence; claim 49 depends from claim 48; the word "exposed" has been removed from 
claim 59; claim 62 now depends from claim 59; claim 64 now depends from claim 63; 
and "comprises-a" has been changed to "comprises a" in claim 68. No new matter has 
been added to the claims as amended. 

VI. Rejection Under 35 U.S.C. S 102 

Claims 27, 35 and 43 stand rejected under 35 U.S.C. § 102(b) as being anticipated 
by Sakon et al Sakon teaches an amino acid sequence that is 72.1% identical with SEQ 
ID NO: 8; however, Sakon describes a GH5 family enzyme (PAGE 10649, first column, 
first full paragraph). Claim 27 has been amended to recite the functionality of at least 
one of family GH6 and GH12, and this distinguishes Sakon. Claims 35 and 43 depend 
from claim 27 and should likewise be allowable. 

VII. Rejection Under 35 U.S.C. S 102/103 

Claims 1-21, 27-35, 43, 48, 54 and 58-67 stand rejected under 35 U.S.C. § 102(b) 
as anticipated by or, in the alternative, under 35 U.S.C. § 103(a) as obvious over either of 
Tucker et al, Adney et al, Himmel et al, Lastick et al or Barker et al The references 
each teach a thermostable cellulase from Acidothermus cellulolyticus. The Office invites 
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Applicants to provide convincing proof that the cellulases of the given references are not 
the same as the instant enzyme. 

The distinction can be shown by molecular weight comparison. The molecular 
weight of GUXA is 124,597 Daltons. This compares to Endo I and n having respective 
molecular weights of 57,420 to 74,580 as reported in Adney et al 5,366,844 (column 7, 
lines 61-62); 52,500 Daltons as reported in Himmel et al. 5,432,075 (see column 7, line 
27 and abstract); and 156,600 to 203,400 Daltons as reported in Tucker et al. 5,1 10,735 
(Examples 7-8). 

Li addition to the foregoing, Adney et al. 5,366,884 at column 10, lines 1-2 shows 
a sequence: 

ASSDTGLFSASPTYDGVYRQGLAVAGLIAAGQHPAAXAIDXLL 

This sequence cannot be found in the reported sequence for GUXA which, therefore, is 
not shown in US 5,366,884. 

As to Lastick et al., US 5,514, 584 references the cellulases described in Tucker et 
al. 5,1 10,735 (column 2, line 66 to column 3 at line 2). In like manner, Baker reports 
using only "endo 1," which is the same as Endo I reported in US 5,275,944 (see page 15, 
4^*" full paragraph, chart on page 1216 denoting "A.c, El" and footnote 31). The '944 
patent is a companion case - a CZP- to US 5,1 10,735 (discussed above) and is directed 
towards the 57,420 to 74,580 Dalton Endo 1 that is also described in US 5,1 10,735 

The foregoing evidence is sufficient to show that the claimed GUXA is distinct 
because none of the reported molecular weights is even arguably 124,597 Daltons. 
AppKcant's attorney respectfully solicits withdrawal of the rejection. 

VIIL Conclusion 

Applicants' attorney respectfully solicits a Notice of Allowance in this apphcation. 
The Commissioner is authorized to charge any additionally required fees to deposit 
account 14-0460. Should the Examiner have any questions, comments, or suggestions 
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that would expedite the prosecution of the present case to allowance, Applicants' 
undersigned representative earnestly requests a telephone call at (303) 384-7575. 



National Renewable Energy Laboratory 
1617 Cole Boulevard 
Golden, CO. 80401 
(303) 384-7575 



CERTIFICATE OF MAILING UNDER 37 CFR § L8 

I hereby certify that the following attached items: 

1 . Amendment [ 1 5 pages] 

2. Statement to Support Filing and Submission in Accordance with 37 CFR 
1.821-1.825 

3. Replacement Figure 2 

4. Marked-Up Sequence Listing [22 pages] 

5. Substitute Sequence Listing [ 1 8 pages] 

6. Petition and Fee for Extension of Time Under 3 7 CFR 1 . 1 36 and 1 . 1 7(a)(2) 

7. Postcard receipt 

are being deposited in the United States Postal Service as first class mail, postage pre- 
paid, in an envelope addressed to: Commissioner for Patents-^S. Patent & Trademarl 
Office, P.O. Box 1450, Alexandria, Va 22313-1450 on this^ day of December 2003. 



Respectfully Submitted, 




Senior Patent Counsel 




Paul J. White 
Senior Patent Counsel 
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MARKED-UP SEQUENCE LISTING 

<110> DING, SHI-YOU 
ADNEY, WILLIAM S. 
VINZANT, TODD B. 
DECKER, STEPHEN R. 
HIMMEL, MICHAEL E. 

<120> THERMAL TOLERANT CELLULASE FROM ACIDOTHERMUS 
CELLULOLYTICUS 

<130> 40197.6WQO M Q17Q 6USni 

<140> 09/917,7383 
<1 41 > 2001-07-28 

<160> 14 

<170> Patentin VersionT32.1 

<210> 1 
<211> 1228 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Segment of 
GuxA 

<400> 1 

Met Glu Arg Thr Gin Gin Ser Gly Arg Asn Cys Arg Tyr Gin Arg Gly 
15 10 15 

Thr Thr Arg Met Pro Ala lie Ser Lys Arg Leu Arg Ala Gly Val Leu 
20 25 30 

Ala Gly Ala Val Ser lie Ala Ala Ser lie Val Pro Leu Ala Met Gin 
35 40 45 

His Pro Ala lie Ala Ala Thr His Val Asp Asn Pro Tyr Ala Gly Ala 
50 55 60 

Thr Phe Phe Val Asn Pro Tyr Trp Ala Gin Glu Val Gin Ser Glu Ala 
65 70 75 80 

Ala Asn Gin Thr Asn Ala Thr Leu Ala Ala Lys Met Arg Val Val Ser 
85 90 95 

Thr Tyr Ser Thr Ala Val Trp Met Asp Arg lie Ala Ala He Asn Gly 
100 105 110 

Val Asn Gly Gly Pro Gly Leu Thr Thr Tyr Leu Asp Ala Ala Leu Ser 
115 120 125 

Gin Gin Gin Gly Thr Thr Pro Glu Val lie Glu He Val lie Tyr Asp 
130 135 140 
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Leu Pro Gly Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Leu Pro 
145 150 155 160 

Ala Thr Ala Ala Gly Leu Gin Thr Tyr Glu Thr Gin Tyr lie Asp Pro 
165 170 175 

He Ala Ser He Leu Ser Asn Pro Lys Tyr Ser Ser Leu Arg He Val 
180 185 190 

Thr He He Glu Pro Asp Ser Leu Pro Asn Ala Val Thr Asn Met Ser 
195 200 205 

He Gin Ala Cys Ala Thr Ala Val Pro Tyr Tyr Glu Gin Gly He Glu 
210 215 220 

Tyr Ala Leu Thr Lys Leu His Ala He Pro Asn Val Tyr lie Tyr Met 
225 230 235 240 

Asp Ala Ala His Ser Gly Trp Leu Gly Trp Pro Asn Asn Ala Ser Gly 
245 250 255 

Tyr Val Gin Glu Val Gin Lys Val Leu Asn Ala Ser He Gly Val Asn 
260 265 270 

Gly He Asp Gly Phe Val Thr Asn Thr Ala Asn Tyr Thr Pro Leu Lys 
275 280 285 

Glu Pro Phe Met Thr Ala Thr Gin Gin Val Gly Gly Gin Pro Val Glu 
290 295 300 

Ser Ala Asn Phe Tyr Gin Trp Asn Pro Asp He Asp Glu Ala Asp Tyr 
305 310 315 320 

Ala Val Asp Leu Tyr Ser Arg Leu Val Ala Ala Gly Phe Pro Ser Ser 
325 330 335 

He Gly Met Leu He Asp Thr Leu Arg Asn Gly Trp Gly Gly Pro Asn 
340 345 350 

Glu Pro Thr Gly Pro Ser Thr Ala Thr Asp Val Asn Thr Phe Val Asn 
355 360 365 

Gin Ser Lys He Asp Leu Arg Gin His Arg Gly Leu Trp Cys Asn Gin 
370 375 380 

Asn Gly Ala Gly Leu Gly Gin Pro Pro Gin Ala Ser Pro Thr Asp Phe 
385 390 395 400 

Pro Asn Ala His Leu Asp Ala Tyr Val Trp He Lys Pro Pro Gly Glu 
405 410 415 

Ser Asp Gly Thr Ser Ala Ala Ser Asp Pro Thr Thr Gly Lys Lys Ser 
420 425 430 

Asp Pro Met Cys Asp Pro Thr Tyr Thr Thr Ser Tyr Gly Val Leu Thr 
435 440 445 
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Asn Ala Leu Pro Asn Ser Pro lie Ala Gly Gin Trp Phe Pro Ala Gin 
450 455 460 

Phe Asp Gin Leu Val Ala Asn Ala Arg Pro Ala Val Pro Thr Ser Thr 
465 470 475 480 

Ser Ser Ser Pro Pro Pro Pro Pro Pro Ser Pro Ser Ala Ser Pro Ser 
485 490 495 

Pro Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser 
500 505 510 

Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser 
515 520 525 

Ser Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser 
530 535 540 

Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser Ser Ser 
545 550 555 560 

Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser 
565 570 575 

Pro Ser Pro Thr Ser Ser Pro Val Ser Gly Gly Leu Lys Val Gin Tyr 
580 585 590 

Lys Asn Asn Asp Ser Ala Pro Gly Asp Asn Gin lie Lys Pro Gly Leu 
595 600 605 

Gin Leu Val Asn Thr Gly Ser Ser Ser Val Asp Leu Ser Thr Val Thr 
610 615 620 

Val Arg Tyr Trp Phe Thr Arg Asp Gly Gly Ser Ser Thr Leu Val Tyr 
625 630 635 640 

Asn Cys Asp Trp Ala Ala Met Gly Cys Gly Asn He Arg Ala Ser Phe 
645 650 655 

Gly Ser Val Asn Pro Ala Thr Pro Thr Ala Asp Thr Tyr Leu Gin Leu 
660 665 670 

Ser Phe Thr Gly Gly Thr Leu Ala Ala Gly Gly Ser Thr Gly Glu lie 
675 680 685 

Gin Asn Arg Val Asn Lys Ser Asp Trp Ser Asn Phe Thr Glu Thr Asn 
690 695 700 

Asp Tyr Ser Tyr Gly Thr Asn Thr Thr Phe Gin Asp Trp Thr Lys Val 
705 710 715 720 

Thr Val Tyr Val Asn Gly Val Leu Val Trp Gly Thr Glu Pro Ser Gly 
725 730 735 

Thr Ser Pro Ser Pro Thr Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser 
740 745 750 
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Pro Gly Gly Asp Val Thr Pro Pro Ser Val Pro Thr Gly Leu Val Val 
755 760 765 

Thr Gly Val Ser Gly Ser Ser Val Ser Leu Ala Trp Asn Ala Ser Thr 
770 775 780 

Asp Asn Val Gly Val Ala His Tyr Asn Val Tyr Arg Asn Gly Val Leu 
785 790 795 800 

Val Gly Gin Pro Thr Val Thr Ser Phe Thr Asp Thr Gly Leu Ala Ala 
805 810 815 

Gly Thr Ala Tyr Thr Tyr Thr Val Ala Ala Val Asp Ala Ala Gly Asn 
820 825 830 

Thr Ser Ala Pro Ser Thr Pro Val Thr Ala Thr Thr Thr Ser Pro Ser 
835 840 845 

Pro Ser Pro Thr Pro Thr Gly Thr Thr Val Thr Asp Cys Thr Pro Gly 
850 855 860 

Pro Asn Gin Asn Gly Val Thr Ser Val Gin Gly Asp Glu Tyr Arg Val 
865 870 875 880 

Gin Thr Asn Glu Trp Asn Ser Ser Ala Gin Gin Cys Leu Thr lie Asn 
885 890 895 

Thr Ala Thr Gly Ala Trp Thr Val Ser Thr Ala Asn Phe Ser Gly Gly 
900 905 910 

Thr Gly Gly Ala Pro Ala Thr Tyr Pro Ser lie Tyr Lys Gly Cys His 
915 920 925 

Trp Gly Asn Cys Thr Thr Lys Asn Val Gly Met Pro lie Gin lie Ser 
930 935 940 

Gin lie Gly Ser Ala Val Thr Ser Trp Ser Thr Thr Gin Val Ser Ser 
945 950 955 960 

Gly Ala Tyr Asp Val Ala Tyr Asp He Trp Thr Asn Ser Thr Pro Thr 
965 970 975 

Thr Thr Gly Gin Pro Asn Gly Thr Glu lie Met He Trp Leu Asn Ser 
980 985 990 

Arg Gly Gly Val Gin Pro Phe Gly Ser Gin Thr Ala Thr Gly Val Thr 
995 1000 1005 

Val Ala Gly His Thr Trp Asn Val Trp Gin Gly Gin Gin Thr Ser Trp 
1010 1015 1020 

Lys lie lie Ser Tyr Val Leu Thr Pro Gly Ala Thr Ser lie Ser Asn 
1025 1030 1035 1040 

Leu Asp Leu Lys Ala He Phe Ala Asp Ala Ala Ala Arg Gly Ser Leu 
1045 1050 1055 
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Asn Thr Ser Asp Tyr Leu Leu Asp Val Glu Ala Gly Phe Glu lie Trp 
1060 1065 1070 

Gin Gly Gly Gin Gly Leu Gly Ser Asn Ser Phe Ser Val Ser Val Thr 
1075 1080 1085 

Ser Gly Thr Ser Ser Pro Thr Pro Ser Pro Ser Pro Thr Pro Thr Pro 
1090 1095 1100 

Ser Pro Thr Pro Thr Pro Ser Pro Ser Pro Thr Pro Ser Pro Ser Pro 
1105 1110 1115 112 

Thr Ser Ser Pro Ser Ser Ser Gly Val Ala Cys Arg Ala Thr Tyr Val 
1125 1130 1135 

Val Asn Ser Asp Trp Gly Ser Gly Phe Thr Ala Thr Val Thr Val Thr 
1140 1145 1150 

Asn Thr Gly Ser Arg Ala Thr Asn Gly Trp Thr Val Ala Trp Ser Phe 
1155 1160 1165 

Gly Gly Asn Gin Thr Val Thr Asn Tyr Trp Asn Thr Ala Leu Thr Gin 
1170 1175 1180 

Ser Gly Ala Ser Val Thr Ala Thr Asn Leu Ser Tyr Asn Asn Val lie 
1185 1190 1195 120 

Gin Pro Gly Gin Ser Thr Thr Phe Gly Phe Asn Gly Ser Tyr Ser Gly 
1205 1210 1215 

Thr Asn Ala Ala Pro Thr Leu Ser Cys Thr Ala Ser 



<210>2 
<211>3687 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Segment of 
GuxA 

<400> 2 

atggagcgaa cccaacaatc cggacggaac tgcaggtacc agagaggaac gacacgaatg 60 

cccgccatct caaaacggct gcgagccggc gtcctcgccg gggcggtgag catcgcagcc 1 20 

tccatcgtgc cgctggcgat gcagcatcct gccatcgccg cgacgcacgt cgacaatccc 1 80 

tatgcgggag cgaccttctt cgtcaacccg tactgggcgc aagaagtaca gagcgaagcg 240 

gcgaaccaga ccaatgccac tctcgcagcg aaaatgcgcg tcgtttccac atattcgacg 300 

gccgtctgga tggaccgcat cgctgcgatc aacggcgtca acggcggacc cggcttgacg 360 

acatatctgg acgccgccct ctcccagcag cagggaacca cccctgaagt cattgagatt 420 

gtcatctacg atctgccggg acgcgactgc gcggcgctcg cctccaacgg cgaactgccc 480 

gctacggcag caggtttgca gacctatgaa acgcagtaca tcgatccgat tgcgagtatc 540 

ctgagcaatc cgaagtactc cagcctgcgg atcgtgacga tcattgagcc ggactcgctg 600 

ccaaacgcgg tcaccaatat gagcattcaa gcgtgtgcaa cggcggtgcc gtattacgag 660 

caaggcatcg agtacgcgct cacgaaattg cacgccattc cgaacgtgta catctacatg 720 

gacgccgccc actccggctg gcttgggtgg cccaataatg ccagcggata cgtacaggaa 780 

gtccagaagg tcctcaacgc gagcatcggg gtcaacggca tcgacggctt cgtcaccaac 840 
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acggcgaatt acacgccgtt gaaggagccg ttcatgaccg ccacccagca ggtcggcgga 900 

cagccggtgg agtcggcgaa tttctaccag tggaatcctg acatcgacga agccgactac 960 

gcggttgact tgtactcgcg gctcgtcgcc gctggctttc caagcagcat cggcatgctc 1 020 

atcgacacct tacgcaacgg ttggggtggt ccgaacgaac caacaggccc gagcaccgcg 1080 

accgatgtca acaccttcgt caaccagtcg aagattgacc ttcggcagca ccgcggcctg 1 140 

tggtgcaacc agaacggtgc gggcctcggc cagccgccgc aggcaagccc gacggacttc 1 200 

ccgaacgcgc acctcgacgc gtatgtctgg atcaagccgc cgggtgagtc ggacggcaca 1260 

agcgctgcga gcgatccgac aactggcaag aagtcggacc ccatgtgcga cccgacgtac 1 320 

acgacgtcgt acggggtact gaccaacgcg ttaccgaact ccccgatcgc cggccagtgg 1 380 

ttcccggcgc agtttgacca gcttgtcgcg aacgcacggc cagcggtgcc gacgtcgacc 1440 

agctcgagcc cgccgcctcc gccgccgagt ccgtcggctt cgccgagtcc gagcccgagt 1 500 

ccgagcccga gcagctcgcc atcgccgtcg ccgtctccga gctcgagccc gtctccgtcg 1560 

ccgagcccga gtccgagccc gagtagctcg ccgtcgccgt ctccgagctc gagcccgtct 1620 

ccgtcgccga gcccgagtcc gagcccgagt agctcgccgt cgccgtctcc gagctcgagc 1 680 

ccgtctccgt cgccgagccc gagtccgagc ccgagtagct cgccgtcgcc gtctccgacg 1740 

tcgtcgccgg tgtcgggtgg gctgaaggtg cagtacaaga acaatgattc ggcgccgggt 1800 

gataaccaga tcaaaccggg tctccagttg gtgaataccg ggtcgtcgtc ggtggatttg i860 

tcgacggtga cggtgcggta ctggttcacc cgggatggtg ggtcgtcgac actggtgtac 1920 

aactgtgact gggcggcgat ggggtgtggg aatatccgcg cctcgttcgg ctcggtgaac 1 980 

ccggcgacgc cgacggcgga cacctacctg cagttgtcgt tcactggtgg aacgttggcc 2040 

gctggtgggt cgacgggtga gattcaaaac cgggtgaata agagtgactg gtcgaatttc 2100 

accgagacca atgactactc gtatgggacg aacaccacct tccaggactg gacgaaggtg 2160 

acggtgtacg tcaacggcgt gttggtgtgg gggactgaac cgtccggcac cagccccagc 2220 

cccacaccat ccccgagccc gagcccgagc ccgagcccgg gtggggatgt gacgccgccg 2280 

agtgtgccga ccggcttggt ggtgacgggg gtgagtgggt cgtcggtgtc gttggcgtgg 2340 

aatgcgtcga cggataacgt gggggtggcg cattacaacg tgtaccgcaa cggggtgttg 2400 

gtgggccagc cgacggtgac ctcgttcacc gacacgggtt tggccgcggg aaccgcgtac 2460 

acctacacgg tggccgcggt ggacgctgcg ggtaacacct ccgccccatc cacccccgtc 2520 

accgccacca ccacgagtcc cagccccagc cccacgccga cggggaccac ggtcaccgac 2580 

tgcacgcccg gtcctaacca gaatggtgtg accagcgtgc agggcgacga ataccgggtg 2640 

cagaccaatg agtggaattc gtcggcccag cagtgcctca ccatcaatac cgcgaccggt 2700 

gcctggacgg tgagcactgc gaacttcagc ggtgggaccg gcggtgcgcc cgcgacgtat 2760 

ccgtcgatct acaagggctg ccactggggc aactgcacca cgaagaacgt cgggatgccg 2820 

attcagatca gtcagattgg ttcggctgtg acgtcgtgga gtacgacgca ggtgtcgtcg 2880 

ggcgcgtatg acgtggccta cgacatttgg acgaacagta ccccaacgac aaccggtcag 2940 

ccaaacggta ccgaaatcat gatttggctg aattcgcgtg gtggggtgca gccgttcggg 3000 

tcgcagacag cgacgggtgt gacggtcgct ggtcacacgt ggaatgtctg gcagggtcag 3060 

cagacctcgt ggaagattat ttcctacgtc ctgacccccg gtgcgacgtc gatcagtaat 3120 

ctggatttga aggcgatttt cgcggacgcc gcggcacgcg ggtcgctcaa cacctccgat 31 80 

tacctgctcg acgttgaggc cgggtttgag atctggcaag gtggtcaggg cctgggcagc 3240 

aactcgttca gcgtctccgt gacgagcggc acgtccagcc cgacaccgag cccgagcccg 3300 

acgccgacac cgagcccgac gccgacaccg tctccgagcc cgaccccgtc gccgagtccg 3360 

accagctcgc cgtcgtcgtc gggtgtggcg tgccgggcga cgtatgtggt gaatagtgat 3420 

tggggttctg ggtttacggc gacggtgacg gtgacgaata ccgggagccg ggcgacgaac 3480 

gggtggacgg tggcgtggtc gtttggtggg aatcagacgg tcacgaacta ctggaacact 3540 

gcgttgaccc aatcaggtgc atcggtgacg gcgacgaacc tgagttacaa caacgtgatc 3600 

caaccgggtc agtcgaccac cttcggattc aacggaagtt actcaggaac aaacgccgcg 3660 

ccgacgctca gctgcacagc cagctga 3687 



<210> 3 
<211>53 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Segment of 
GuxA 



<400> 3 

Met GIu Arg Thr Gin Gin Ser Gly Arg Asn Cys Arg Tyr Gin Arg Gly 
15 10 15 

Thr Thr Arg Met Pro Ala lie Ser Lys Arg Leu Arg Ala Gly Val Leu 
20 25 30 

Ala Gly Ala Val Ser lie Ala Ala Ser Me Val Pro Leu Ala Met Gin 
35 40 45 

His Pro Ala lie Ala 
50 



<210>4 
<211>423 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Segment of 
GuxA 

<400> 4 

Ala Thr His Val Asp Asn Pro Tyr Ala Gly Ala Thr Phe Phe Val Asn 
15 10 15 

Pro Tyr Trp Ala Gin GIu Val Gin Ser GIu Ala Ala Asn Gin Thr Asn 
20 25 30 

Ala Thr Leu Ala Ala Lys Met Arg Val Val Ser Thr Tyr Ser Thr Ala 
35 40 45 

Val Trp Met Asp Arg lie Ala Ala He Asn Gly Val Asn Gly Gly Pro 
50 55 60 

Gly Leu Thr Thr Tyr Leu Asp Ala Ala Leu Ser Gin Gin Gin Gly Thr 
65 70 75 80 

Thr Pro GIu Val lie GIu lie Val He Tyr Asp Leu Pro Gly Arg Asp 
85 90 95 

Cys Ala Ala Leu Ala Ser Asn Gly GIu Leu Pro Ala Thr Ala Ala Gly 
100 105 110 

Leu Gin Thr Tyr GIu Thr Gin Tyr He Asp Pro He Ala Ser He Leu 
115 120 125 

Ser Asn Pro Lys Tyr Ser Ser Leu Arg He Val Thr He He GIu Pro 
130 135 140 

Asp Ser Leu Pro Asn Ala Val Thr Asn Met Ser Me Gin Ala Cys Ala 
145 150 155 160 

Thr Ala Val Pro Tyr Tyr GIu Gin Gly He GIu Tyr Ala Leu Thr Lys 
165 170 175 



Leu His Ala lie Pro Asn Val Tyr He Tyr Met Asp Ala Ala His Ser 
180 185 190 

Gly Trp Leu Gly Trp Pro Asn Asn Ala Ser Gly Tyr Val Gin Glu Val 
195 200 205 

Gin Lys Val Leu Asn Ala Ser lie Gly Val Asn Gly lie Asp Gly Phe 
210 215 220 

Val Thr Asn Thr Ala Asn Tyr Thr Pro Leu Lys Glu Pro Phe Met Thr 
225 230 235 240 

Ala Thr Gin Gin Val Gly Gly Gin Pro Val Glu Ser Ala Asn Phe Tyr 
245 250 255 

Gin Trp Asn Pro Asp lie Asp Glu Ala Asp Tyr Ala Val Asp Leu Tyr 
260 265 270 

Ser Arg Leu Val Ala Ala Gly Phe Pro Ser Ser lie Gly Met Leu He 
275 280 285 

Asp Thr Leu Arg Asn Gly Trp Gly Gly Pro Asn Glu Pro Thr Gly Pro 
290 295 300 

Ser Thr Ala Thr Asp Val Asn Thr Phe Val Asn Gin Ser Lys He Asp 
305 310 315 320 

Leu Arg Gin His Arg Gly Leu Trp Cys Asn Gin Asn Gly Ala Gly Leu 
325 330 335 

Gly Gin Pro Pro Gin Ala Ser Pro Thr Asp Phe Pro Asn Ala His Leu 
340 345 350 

Asp Ala Tyr Val Trp He Lys Pro Pro Gly Glu Ser Asp Gly Thr Ser 
355 360 365 

Ala Ala Ser Asp Pro Thr Thr Gly Lys Lys Ser Asp Pro Met Cys Asp 
370 375 380 

Pro Thr Tyr Thr Thr Ser Tyr Gly Val Leu Thr Asn Ala Leu Pro Asn 
385 390 395 400 

Ser Pro He Ala Gly Gin Trp Phe Pro Ala Gin Phe Asp Gin Leu Val 
405 410 415 



Ala Asn Ala Arg Pro Ala Val 
420 



<210>5 
<211> 150 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Segment of 
GuxA 



<400> 5 

Val Ser Gly Gly Leu Lys Val Gin Tyr Lys Asn Asn Asp Ser Ala Pro 
15 10 15 

Gly Asp Asn Gin lie Lys Pro Gly Leu Gin Leu Val Asn Thr Gly Ser 
20 25 30 

Ser Ser Val Asp Leu Ser Thr Val Thr Val Arg Tyr Trp Phe Thr Arg 
35 40 45 

Asp Gly Gly Ser Ser Thr Leu Val Tyr Asn Cys Asp Trp Ala Ala Met 
50 55 60 

Gly Cys Gly Asn lie Arg Ala Ser Phe Gly Ser Val Asn Pro Ala Thr 
65 70 75 80 

Pro Thr Ala Asp Thr Tyr Leu Gin Leu Ser Phe Thr Gly Gly Thr Leu 
85 90 95 

Ala Ala Gly Gly Ser Thr Gly Glu lie Gin Asn Arg Val Asn Lys Ser 
100 105 110 

Asp Trp Ser Asn Phe Thr Glu Thr Asn Asp Tyr Ser Tyr Gly Thr Asn 
115 120 125 

Thr Thr Phe Gin Asp Trp Thr Lys Val Thr Val Tyr Val Asn Gly Val 
130 135 140 

Leu Val Trp Gly Thr Glu 
145 150 



<210>6 
<211> 854043 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Segment of 
GuxA 

<400> 6 

Asp Val Thr Pro Pro Ser Val Pro Thr GIv Leu Val Val Thr GIv Val 
J 5 10 15 

Ser GIv Ser Ser Val Ser Leu Ala Trp Asn Ala Ser Thr Asp Asn Val 
20 25 30 

GIv Val Ala His Tvr Asn Val Tvr Aro Asn GIv Val Leu Val GIv Gin 
35 40 45 

Pro Thr Val Thr Ser Phe Thr Asp Thr GIv Leu Ala Ala GIv Thr Ala 
50 55 60 



Tvr Thr Tvr Thr Val Ala Ala Val Asp Ala Ala GIv Asn Thr Ser Ala 
.65 70 75 80 
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ProSerThrPro Val 
85 

Mot G l u Arg Thr CIn C I n Sor Cly Arg Acn Cyo Arg Tyr CIn Arg Cly 
^1 g . 40 4§ 

Thr Thr Arg Mot Pro Ala llo Sor Lys Arg Lou Arg A l a Gly Val Lou 
2e 3§ 3© 

Ala Cly Ala Va l Sor llo Ala Ala Sor l ie Val Pro Lou A l a Met C I n 
3§— 4e 4§ 

Hio Pro A l a ll o Ala Ala Thr H i o Va l Aop Aon Pro Tyr A l a C l y A l a 
SO s§ go 

Thr Pho Pho Val Aon Pro Tyr Trp A l a CIn Clu Vo l C I n Sor Clu A l a 

-6§ — ;0 ■ j% ^ 



Ala Asn Gin Thr Asn A l a Thr Lou Ala A l a Lyo Mot Arg Va l Val Sor 
8S —90- — 9§ 

Thr Tyr Sor Thr Ala Val Trp Mot Aop Arg l lo Ala A l a ll o Aon C l y 
4-00 40S -A^ 

Va l Asn C l y G l y Pro Cly Lou Thr Thr Tyr Lou Aop Ala Ala Leu G or 
446 430 42§ 

Gin Gin C I n Gly Thr Thr Pro Glu Val llo Glu llo Val l lo Tyr Asp 
430 43§ 440 

Lou Pro Cly Arg Aop Cyo A l a Ala Lou Ala Sor Aon C l y Clu Lou Pro 
44§ _4§e 4^ 

Ala Thr Ala Ala C l y Lou Gin Thr Tyr C l u Thr C I n Tyr l l o Aop Pro 
4€S- -470- 47§ 

He Ala Sor lie Lou Sor Aon Pro Lyo Tyr Sor Sor Lou Arg llo Val 
4«0- 4«§ 400 

Thr ll o llo Clu Pro Asp Sor Lou Pro Acn Ala Val Thr Acn Mot Sor 
4«§ 200 206 

l i e G i n Ala Cyc A l a Thr Ala Va l Pro Tyr Tyr Glu CIn G l y l lo Clu 

246 — 220 



Tyr Ala Lou Thr Lye Lou I lis Ala llo Pro Aon Val Tyr lie Tyr Met 
226^ 230 236 



240 



Asp Ala Ala His Sor C l y Trp Lou Cly Trp Pro Asn Asn Ala Sor Cly 
246 260— 266 

Tyr Val CIn Glu Val CIn Lys Va l Lou Asn A l a Sor llo Gly Va l Asn 
-see 266 270 



Gly llo Aop G l y Pho Vol Thr Aon Thr Ala Aon Tyr Thr Pro Lou Lyo 

276 280 285 

Glu Pro Pho Mot Thr Ala Thr CIn C I n Vo l C l y C l y C I n Pro Val C l u 
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■^90- 



-300 



S e r Ala Acn Pho Tyr Gin Trp Acn Pro Asp llo Aop Glu Ala Aop Tyr 

30§ 340 34€ 

Ala Vol Aop Lou Tyr Sor Arg Lou Vol Ala Ala Cly Pho Pro Sor Sor 
32§ 330 336 

llo G l y Mot Lou lie Aop Thr Lou Arg Aon Gly Trp Gly Gly Pro Aon 
340 34§ 3§0 

Glu Pro Thr Cly Pro Sor Thr Ala Thr Aop Vol Aon Thr Pho Va! Aon 
3§§ 3©0 3g§ 

Gin Sor Lyo ll o Aop Lou Arg Gin H i o Arg Gly Lou Trp Gyo Aon Gin 
370^ -37S 380 



Aon G l y A l a G l y Lou Cly Gin Pro Pro Gin Ala Sor Pro Thr Aop Pho 
38§ 390 39S 400 

Pro Asn A l a Hio Leu Aop Ala Tyr Vo l Trp l l o Lys Pro Pro C l y Clu 
40§ 44*^ 44§ 

Sor Aop Cly Thr Sor Ala Ala Sor Aop Pro Thr Thr G l y Lyo Lyo Sor 
420 42€ 430 

Aep Pro Mot Cyc Aop Pro Thr Tyr Thr Thr Sor Tyr G l y Va l Lou Thr 
43§ 440 44§ 

Asn A l a Lou Pro Aon Ser Pro llo Ala G l y G i n Trp Pho Pro A l a CIn 
4€0 4§§ 460 

Ph e Asp Gin Lou Va l A l a Aon A l a Arg Pro Ala Va l Val Sor Cly C l y 
46§ 4?0 47§ 480 

L e u Lyo Val Gin Tyr Lyo Asn Aon Aop Sor Ala Pro G l y Aop Aon CIn 
48§ 490 49S 

llo Lye Pro G l y Lou CIn Lou Vol Aon Thr G l y Sor Sor Sor Val Aop 
§00 §o§ §4© 

Lou Sor Thr Val Thr Val Arg Tyr Trp Pho Thr Arg Aep Gly Cly Sor 
§43 —§20 §2§ 

Sor Thr Lou Va l Tyr Asn Cys Aop Trp Ala Ala Mot Gly Cyo Cly Aon 
§39 §3§ §40 



l lo Arg A l a Sor Pho Cly Sor Val Aon Pro Ala Thr Pro Thr A l a Aop 
S4§ §§0 666 §60 

Thr Tyr Lou C I n Lou Sor Pho Thr C l y G l y Thr Lou A l a Ala Cly C l y 

§§§ §70 §7S 

Sor Thr Cly G l u ll o CIn Aon Arg Vo l Aon Lyo Sor Asp Trp Sor Asn 

§80 --§8§ §90 

Pho Thr Clu Thr Aon Aop Tyr Sor Tyr G l y Thr Aon Thr Thr Pho CIn 
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-800- 



-€0& 



Acp Trp Thr Lye Val Thr Va l Tyr Vo l Acn C l y Vol Lou Vol Tip Cly 
§40— — S4€ g20 

Thr Glu Aop Val Thr Pro Pro Sor Val Pro Thr C l y Lou Vol Val Thr 
§2g §30 §3§ 610 

Gly Vol Sor C l y Sor Sor Va l Sor Lou Ala Trp Aon A l a Sor Thr Aop 
64S §§e g§§ 

Asn Val G l y Va l Ala H i o Tyr Aon Vol Tyr Arg Aon Gly Val Lou Val 
§§0 §§§_ g?o 

Gly Gin Pro Thr Val Thr Sor Pho Thr Aop Thr Cly Lou Ala Ala Cly 
§80 §86 



Thr Ala Tyr Thr Tyr Thr Val Ala Ala Vol Acp Ala Ala Gly Aon Thr 
§80 §8€ TOO 

Sor Ala Pro Sor Thr Pro Val Aop Cyc Thr Pro C l y Pro Aon C I n Aon 
^ ?40 ?30 

Gly Val Thr Sor Vo l C I n Aop Cly Glu Tyr Arg Val Gin Thr Aon Clu 
— 73§ _?30 73§ 

Trp Aon Sor Sor Ala Gin Gin Cyc Lou Thr l lo Acn Thr Alo Thr Cly 
?40 74§ 7§o 

A l a Trp Thr <?f>r Thr Ain Am r> | y r;|y jhr Cly Cly Alo 
?§§ 7§0 ?§§ 

Pro A l a Thr Tyr Pro Sor llo Tyr Lyo G l y Cyo H i e Trp Gly Aon Cyo 
^ ??§ 780 

Thr Thr Lyo Acn Va l Cly Mot Pro llo C I n l lo Sor CIn l lo G l y Sor 
78S 790 ?9§ 800 

Ala Val Thr Sor Trp Sor Thr Thr Gin Va l Sor Sor G l y Ala Tyr Aop 
— 865 810 84# 

Val Ala Tyr Acp l l o Trp Thr Aon Sor Thr Pro Thr Thr Thr Cly CIn 
830 83S 830 

Pro Aon Cly Thr C l u ll o Mot ll o Trp Lou Aon Sor Arg Gly C l y Vol 
83§ 840 84§ 

Gin Pro Phc Cly Sor CIn Thr Ala Thr Gly Val Thr Val Ala Cly Hio 
8§e 8§§ 8§0 



Thr Trp Aon Va l Trp CIn Cly C I n CIn Thr Sor Trp Lyo llo l ie S or 
8§5 870 87§ 880 

Tyr Val Lou Thr Pro C l y Ala Thr Sor l l o Sor Aon Lou Aop Leu Lyo 
88S 898- -89§ 

Ala l lo Phc Ala Acp A l a A l a Ala Arg C l y Sor Lou Aon Thr Sor Aop 



-13- 



90e 90g -94© 

Tyr Lou Lou Acp Vol Clu Ala Gly Pho Glu l l o Trp Gin G l y Gly CIn 
94€ — 920 93§ 

Gly Lou G l y Sor Aon Sor Pho Sor Va l Sor Vai Thr Sor Gly G l y Vo l 
930 93§ 940 

A l a Cyo Arg A l a Thr Tyr Va l Val Aon Sor Asp Trp G l y Sor C l y Pho 
94§ 960 9gg 960 

Thr Ala Thr Va l Thr Vol Thr Aon Thr Cly Sor Arg Ala Thr Aon Gly 
96§ 970 97§ 

Trp Thr Vol Ala Trp Sor Pho Cly Gly Aon CIn Thr Va l Thr Aon Tyr 
980 986 990 



Trp Aon Thr Ala Lou Thr G i n Sor C l y Ala Sor Val Thr Ala Thr Aon 
996 4000 4005 

Lou Tyr Sor Acn Aon Va l llo Gin Pro C l y Gin Sor Thr Thr Pho Cly 
4040 4046 4020 

Pho Aon C l y Sor Tyr Sor Gly Thr Aon A l a Ala Pro Thr Lou Sor Cyo 
4026 4030 4036 4040 

Thr A l a Sor 



<210>7 
<211>231 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Segment of 
GuxA 

<400> 7 

Asp Cys Thr Pro Gly Pro Asn Gin Asn Gly Val Thr Ser Val Gin GI^Asp 
15 10 15 

AsfiGly Glu Tyr Arg Val Gin Thr Asn Glu Trp Asn Ser Ser Ala Gin Gin 
20 25 30 

Cys Leu Thr lie Asn Thr Ala Thr Gly Ala Trp Thr Val Ser Thr Ala 
35 40 45 

Asn Phe Ser Gly Gly Thr Gly Gly Ala Pro Ala Thr Tyr Pro Ser lie 
50 55 60 

Tyr Lys Gly Cys His Trp Gly Asn Cys Thr Thr Lys Asn Val Gly Met 
65 70 75 80 

Pro lie Gin lie Ser Gin lie Gly Ser Ala Val Thr Ser Trp Ser Thr 
85 90 95 
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Thr Gin Val Ser Ser Gly Ala Tyr Asp Val Ala Tyr Asp lie Tip Thr 
100 105 110 

Asn Ser Thr Pro Thr Thr Thr Gly Gin Pro Asn Gly Thr Glu lie Met 
115 120 125 

He Trp Leu Asn Ser Arg Gly Gly Val Gin Pro Phe Gly Ser Gin Thr 
130 135 140 

Ala Thr Gly Val Thr Val Ala Gly His Thr Trp Asn Val Trp Gin Gly 
145 150 155 160 

Gin Gin Thr Ser Trp Lys Me lie Ser Tyr Val Leu Thr Pro Gly Ala 
165 170 175 

Thr Ser lie Ser Asn Leu Asp Leu Lys Ala lie Phe Ala Asp Ala Ala 
180 185 190 

Ala Arg Gly Ser Leu Asn Thr Ser Asp Tyr Leu Leu Asp Val Glu Ala 
195 200 205 

Gly Phe Glu lie Trp Gin Gly Gly Gin Gly Leu Gly Ser Asn Ser Phe 
210 215 220 

Ser Val Ser Val Thr Ser Gly 
225 230 



<210> 8 
<211> 101 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Segment of 
GuxA 

<400> 8 

Gly Val Ala Cys Arg Ala Thr Tyr Val Val Asn Ser Asp Trp Gly Ser 
15 10 15 

Gly Phe Thr Ala Thr Val Thr Val Thr Asn Thr Gly Ser Arg Ala Thr 
20 25 30 

Asn Gly Trp Thr Val Ala Trp Ser Phe Gly Gly Asn Gin Thr Val Thr 
35 40 45 

Asn Tyr Trp Asn Thr Ala Leu Thr Gin Ser Gly Ala Ser Val Thr Ala 
50 55 60 

Thr Asn Leu Ser TvrT vi^^ Asn Asn Val lie Gin Pro Gly Gin Ser Thr Thr 
65 70 75 80 

Phe Gly Phe Asn Gly Ser Tyr Ser Gly Thr Asn Ala Ala Pro Thr Leu 
85 90 95 

Ser Cys Thr Ala Ser 
100 



-15- 



<210> 9 
<211>423 
<212> PRT 

<213> Acidothermus cellulolyticus 
<400> 9 

Ala Thr His Val Asp Asn Pro Tyr Ala Gly Ala Thr Phe Phe Va! Asn 
15 10 15 

Pro Tyr Trp Ala Gin Glu Val Gin Ser Glu Ala Ala Asn Gin Thr Asn 
20 25 30 

Ala Thr Leu Ala Ala Lys Met Arg Val Val Ser Thr Tyr Ser Thr Ala 
35 40 45 

Val Trp Met Asp Arg lie Ala Ala lie Asn Gly Val Asn Gly Gly Pro 
50 55 60 

Gly Leu Thr Thr Tyr Leu Asp Ala Ala Leu Ser Gin Gin Gin Gly Thr 
65 70 75 80 

Thr Pro Glu Val He Glu lie Val lie Tyr Asp Leu Pro Gly Arg Asp 
85 90 95 

Cys Ala Ala Leu Ala Ser Asn Gly Glu Leu Pro Ala Thr Ala Ala Gly 
100 105 110 

Leu Gin Thr Tyr Glu Thr Gin Tyr He Asp Pro lie Ala Ser He Leu 
115 120 125 

Ser Asn Pro Lys Tyr Ser Ser Leu Arg He Val Thr He He Glu Pro 
130 135 140 

Asp Ser Leu Pro Asn Ala Val Thr Asn Met Ser He Gin Ala Cys Ala 
145 150 155 160 

Thr Ala Val Pro Tyr Tyr Glu Gin Gly He Glu Tyr Ala Leu Thr Lys 
165 170 175 

Leu His Ala He Pro Asn Val Tyr He Tyr Met Asp Ala Ala His Ser 
180 185 190 

Gly Trp Leu Gly Trp Pro Asn Asn Ala Ser Gly Tyr Val Gin Glu Val 
195 200 205 

Gin Lys Val Leu Asn Ala Ser He Gly Val Asn Gly He Asp Gly Phe 
210 215 220 

Val Thr Asn Thr Ala Asn Tyr Thr Pro Leu Lys Glu Pro Phe Met Thr 
225 230 235 240 

Ala Thr Gin Gin Val Gly Gly Gin Pro Val Glu Ser Ala Asn Phe Tyr 
245 250 255 

Gin Trp Asn Pro Asp He Asp Glu Ala Asp Tyr Ala Val Asp Leu Tyr 
260 265 270 
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Ser Arg Leu Val Ala Ala Gly Phe Pro Ser Ser lie Gly Met Leu lie 
275 280 285 

Asp Thr Leu Arg Asn Gly Trp Gly Gly Pro Asn Glu Pro Thr Gly Pro 
290 295 300 

Ser Thr Ala Thr Asp Val Asn Thr Phe Val Asn Gin Ser Lys lie Asp 
305 310 315 320 

Leu Arg Gin His Arg Gly Leu Trp Cys Asn Gin Asn Gly Ala Gly Leu 
325 330 335 

Gly Gin Pro Pro Gin Ala Ser Pro Thr Asp Phe Pro Asn Ala His Leu 
340 345 350 



Asp Ala Tyr Val Trp lie Lys Pro Pro Gly Glu Ser Asp Gly Thr Ser 
355 360 365 

Ala Ala Ser Asp Pro Thr Thr Gly Lys Lys Ser Asp Pro Met Cys Asp 
370 375 380 

Pro Thr Tyr Thr Thr Ser Tyr Gly Val Leu Thr Asn Ala Leu Pro Asn 
385 390 395 400 

Ser Pro lie Ala Gly Gin Trp Phe Pro Ala Gin Phe Asp Gin Leu Val 
405 410 415 

Ala Asn Ala Arg Pro Ala Val 
420 



<210> 10 
<211>430 
<212> PRT 

<213>Cellulomonasfimi 
<400> 10 

Ala Pro Val His Val Asp Asn Pro Tyr Ala Gly Ala Val Gin Tyr Val 
15 10 15 

Asn Pro Thr Trp Ala Ala Ser Val Asn Ala Ala Ala Gly Arg Gin Ser 
20 25 30 

Ala Asp Pro Ala Leu Ala Ala Lys Met Arg Thr Val Ala Gly Gin Pro 
35 40 45 

Thr Ala Val Trp Met Asp Arg lie Ser Ala lie Thr Gly Asn Ala Asp 
50 55 60 

Gly Asn Gly Leu Lys Phe His Leu Asp Asn Ala Val Ala Gin Gin Lys 
65 70 75 80 

Ala Ala Gly Val Pro Leu Val Phe Asn Leu Val lie Tyr Asp Leu Pro 
85 90 95 

Gly Arg Asp Cys Phe Ala Leu Ala Ser Asn Gly Glu Leu Pro Ala Thr 
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100 105 110 

Asp Ala Gly Leu Ala Arg Tyr Lys Ser Glu Tyr lie Asp Pro lie Ala 
115 120 125 

Asp Leu Leu Asp Asn Pro Glu Tyr Glu Ser lie Arg lie Ala Ala Thr 
130 135 140 

lie Glu Pro Asp Ser Leu Pro Asn Leu Thr Thr Asn lie Ser Glu Pro 
145 150 155 160 

Ala Cys Gin Gin Ala Ala Pro Tyr Tyr Arg Gin Gly Val Lys Tyr Ala 
165 170 175 

Leu Asp Lys Leu His Ala lie Pro Asn Val Tyr Asn Tyr lie Asp lie 
180 185 190 

Gly His Ser Gly Trp Leu Gly Trp Asp Ser Asn Ala Gly Pro Ser Ala 
195 200 205 

Thr Leu Phe Ala Glu Val Ala Lys Ser Thr Thr Ala Gly Phe Ala Ser 
210 215 220 

He Asp Gly Phe Val Ser Asp Val Ala Asn Thr Thr Pro Leu Glu Glu 
225 230 235 240 

Pro Leu Leu Ser Asp Ser Ser Leu Thr lie Asn Asn Thr Pro lie Arg 
245 250 255 

Ser Ser Lys Phe Tyr Glu Trp Asn Phe Asp Phe Asp Glu lie Asp Tyr 
260 265 270 

Thr Ala His Met His Arg Leu Leu Val Ala Ala Gly Phe Pro Ser Ser 
275 280 285 

lie Gly Met Leu Val Asp Thr Ser Arg Asn Gly Trp Gly Gly Pro Asn 
290 295 300 

Arg Pro Thr Ser lie Thr Ala Ser Thr Asp Val Asn Ala Tyr Val Asp 
305 310 315 320 

Ala Asn Arg Val Asp Arg Arg Val His Arg Gly Ala Trp Cys Asn Pro 
325 330 335 

Leu Gly Ala Gly He Gly Arg Phe Pro Glu Ala Thr Pro Ser Gly Tyr 
340 345 350 

Ala Ala Ser His Leu Asp Ala Phe Val Trp He Lys Pro Pro Gly Glu 
355 360 365 

Ser Asp Gly Ala Ser Thr Asp lie Pro Asn Asp Gin Gly Lys Arg Phe 
370 375 380 

Asp Arg Met Cys Asp Pro Thr Phe Val Ser Pro Lys Leu Asn Asn Gin 
385 390 395 400 

Leu Thr Gly Ala Thr Pro Asn Ala Pro Leu Ala Gly Gin Trp Phe Glu 
405 410 415 
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Glu Gin Phe Val Thr Leu Val Lys Asn Ala Tyr Pro Val He 

420 425 430 



<210> 11 
<211>432 
<212> PRT 

<213> Thermobifida fusca 
<400> 11 

Pro Gly Gly Pro Thr Asn Pro Pro Thr Asn Pro Gly Glu Lys Val Asp 
15 10 15 

Asn Pro Phe Glu Gly Ala Lys Leu Tyr Val Asn Pro Val Trp Ser Ala 
20 25 30 

Lys Ala Ala Ala Glu Pro Gly Gly Ser Ala Val Ala Asn Glu Ser Thr 
35 40 45 

Ala Val Trp Leu Asp Arg lie Gly Ala lie Glu Gly Asn Asp Ser Pro 
50 55 60 

Thr Thr Gly Ser Met Gly Leu Arg Asp His Leu Glu Glu Ala Val Arg 
65 70 75 80 

Gin Ser Gly Gly Asp Pro Leu Thr lie Gin Val Val He Tyr Asn Leu 
85 90 95 

Pro Gly Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Leu Gly Pro 
100 105 110 

Asp Glu Leu Asp Arg Tyr Lys Ser Glu Tyr lie Asp Pro lie Ala Asp 
115 120 125 

lie Met Trp Asp Phe Ala Asp Tyr Glu Asn Leu Arg He Val Ala lie 
130 135 140 

lie Glu lie Asp Ser Leu Pro Asn Leu Val Thr Asn Val Gly Gly Asn 
145 150 155 160 

Gly Gly Thr Glu Leu Cys Ala Tyr Met Lys Gin Asn Gly Gly Tyr Val 
165 170 175 

Asn Gly Val Gly Tyr Ala Leu Arg Lys Leu Gly Glu He Pro Asn Val 
180 185 190 

Tyr Asn Tyr He Asp Ala Ala His His Gly Trp He Gly Trp Asp Ser 
195 200 205 

Asn Phe Gly Pro Ser Val Asp He Phe Tyr Glu Ala Ala Asn Ala Ser 
210 215 220 

Gly Ser Thr Val Asp Tyr Val His Gly Phe He Ser Asn Thr Ala Asn 
225 230 235 240 

Tyr Ser Ala Thr Val Glu Pro Tyr Leu Asp Val Asn Gly Thr Val Asn 
245 250 255 
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Gly Gin Leu lie Arg Gin Ser Lys Tip Val Asp Trp Asn Gin Tyr Val 
260 265 270 

Asp Glu Leu Ser Phe Val Gin Asp Leu Arg Gin Ala Leu lie Ala Lys 
275 280 285 

Gly Phe Arg Ser Asp lie Gly Met Leu lie Asp Thr Ser Arg Asn Gly 
290 295 300 

Trp Gly Gly Pro Asn Arg Pro Thr Gly Pro Ser Ser Ser Thr Asp Leu 
305 310 315 320 

Asn Thr Tyr Val Asp Glu Ser Arg He Asp Arg Arg lie His Pro Gly 
325 330 335 



Asn Trp Cys Asn Gin Ala Gly Ala Gly Leu Gly Glu Arg Pro Thr Val 
340 345 350 

Asn Pro Ala Pro Gly Val Asp Ala Tyr Val Trp Val Lys Pro Pro Gly 
355 360 365 

Glu Ser Asp Gly Ala Ser Glu Glu lie Pro Asn Asp Glu Gly Lys Gly 
370 375 380 

Phe Asp Arg Met Cys Asp Pro Thr Tyr Gin Gly Asn Ala Arg Asn Gly 
385 390 395 400 

Asn Asn Pro Ser Gly Ala Leu Pro Asn Ala Pro He Ser Gly His Trp 
405 410 415 

Phe Ser Ala Gin Phe Arg Glu Leu Leu Ala Asn Ala Tyr Pro Pro Leu 
420 425 430 



<210> 12 
<211>221 
<212> PRT 

<213> Acidothennus cellulolyticus 
<400> 12 

Asn Gin Gin He Cys Asp Arg Tyr Gly Thr Thr Thr He Gin Asp Arg 
15 10 15 

Tyr Val Val Gin Asn Asn Arg Trp Gly Thr Ser Ala Thr Gin Cys He 
20 25 30 

Asn Val Thr Gly Asn Gly Phe Glu He Thr Gin Ala Asp Gly Ser Val 
35 40 45 

Pro Thr Asn Gly Ala Pro Lys Ser Tyr Pro Ser Val Tyr Asp Gly Cys 
50 55 60 

His Tyr Gly Asn Cys Ala Pro Arg Thr Thr Leu Pro Met Arg He Ser 
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65 70 75 80 

Ser He Gly Ser Ala Pro Ser Ser Val Ser Tyr Arg Tyr Thr Gly Asn 
85 90 95 

Gly Val Tyr Asn Ala Ala Tyr Asp He Trp Leu Asp Pro Thr Pro Arg 
100 105 110 

Thr Asn Gly Val Asn Arg Thr Glu lie Met He Trp Phe Asn Arg Val 
115 120 125 

Gly Pro Val Gin Pro He Gly Ser Pro Val Gly Thr Ala His Val Gly 
130 135 140 

Gly Arg Ser Trp Glu Val Trp Thr Gly Ser Asn Gly Ser Asn Asp Val 
145 150 155 160 

He Ser Phe Leu Ala Pro Ser Ala He Ser Ser Trp Ser Phe Asp Val 
165 170 175 

Lys Asp Phe Val Asp Gin Ala Val Ser His Gly Leu Ala Thr Pro Asp 
180 185 190 

Trp Tyr Leu Thr Ser He Gin Ala Gly Phe Glu Pro Trp Glu Gly Gly 
195 200 205 

Thr Gly Leu Ala Val Asn Ser Phe Ser Ser Ala Val Asn 
210 215 220 



<210> 13 
<211>221 
<212> PRT 

<213> Streptomyces sp. 
<400> 13 

Asp Thr Thr He Cys Glu Pro Phe Gly Thr Thr Thr He Gin Gly Arg 
15 10 15 

Tyr Val Val Gin Asn Asn Arg Trp Gly Ser Thr Ala Pro Gin Cys Val 
20 25 30 

Thr Ala Thr Asp Thr Gly Phe Arg Val Thr Gin Ala Asp Gly Ser Ala 
35 40 45 

Pro Thr Asn Gly Ala Pro Lys Ser Tyr Pro Ser Val Phe Asn Gly Cys 
50 55 60 

His Tyr Thr Asn Cys Ser Pro Gly Thr Asp Leu Pro Val Arg Leu Asp 
65 70 75 80 

Thr Val Ser Ala Ala Pro Ser Ser He Ser Tyr Gly Phe Val Asp Gly 
85 90 95 

Ala Val Tyr Asn Ala Ser Tyr Asp He Trp Leu Asp Pro Thr Ala Arg 
100 105 110 

Thr Asp Gly Val Asn Gin Thr Glu lie Met He Trp Phe Asn Arg Val 
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115 120 125 

Gly Pro lie Gin Pro lie Gly Ser Pro Val Gly Thr Ala Ser Val Gly 
130 135 140 

Gly Arg Thr Trp Glu Val Trp Ser Gly Gly Asn Gly Ser Asn Asp Val 
145 150 155 160 

Leu Ser Phe Val Ala Pro Ser Ala lie Ser Gly Trp Ser Phe Asp Val 
165 170 175 

Met Asp Phe Val Arg Ala Thr Val Ala Arg Gly Leu Ala Glu Asn Asp 
180 185 190 

Trp Tyr Leu Thr Ser Val Gin Ala Gly Phe Glu Pro Trp Gin Asn Gly 
195 200 205 

Ala Gly Leu Ala Val Asn Ser Phe Ser Ser Thr Val Glu 
210 215 220 



<210> 14 
<211>228 
<212> PRT 

<213> Streptomyces lividans 
<400> 14 

Cys Thr Pro Gly Pro Asn Gin Asn Gly Val Thr Ser Val Gin Gly Asp 
15 10 15 

Glu Tyr Arg Val Gin Thr Asn Glu Trp Asn Ser Ser Ala Gin Gin Cys 
20 25 30 

Leu Thr lie Asn Thr Ala Thr Gly Ala Trp Thr Val Ser Thr Ala Asn 
35 40 45 

Phe Ser Gly Gly Thr Gly Gly Ala Pro Ala Thr Tyr Pro Ser lie Tyr 
50 55 60 

Lys Gly Cys His Trp Gly Asn Cys Thr Thr Lys Asn Val Gly Met Pro 
65 70 75 80 

lie Gin He Ser Gin He Gly Ser Ala Val Thr Ser Trp Ser Thr Thr 
85 90 95 

Gin Val Ser Ser Gly Ala Tyr Asp Val Ala Tyr Asp He Trp Thr Asn 
100 105 110 

Ser Thr Pro Thr Thr Thr Gly Gin Pro Asn Gly Thr Glu He Met lie 
115 120 125 

Trp Leu Asn Ser Arg Gly Gly Val Gin Pro Phe Gly Ser Gin Thr Ala 
130 135 140 

Thr Gly Val Thr Val Ala Gly His Thr Trp Asn Val Trp Gin Gly Gin 
145 150 155 160 

Gin Thr Ser Trp Lys lie lie Ser Tyr Val Leu Thr Pro Gly Ala Thr 
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165 170 175 

Ser lie Ser Asn Leu Asp Leu Lys Ala lie Phe Ala Asp Ala Ala Ala 
180 185 190 

Arg Gly Ser Leu Asn Thr Ser Asp Tyr Leu Leu Asp Val Glu Ala Gly 
195 200 205 

Phe Glu lie Trp Gin Gly Gly Gin Gly Leu Gly Ser Asn Ser Phe Ser 
210 215 220 

Val Ser Val Thr 
225 



